Agent Skills: Automatic Speech Recognition (ASR)

Transcribe audio segments to text using Whisper models. Use larger models (small, base, medium, large-v3) for better accuracy, or faster-whisper for optimized performance. Always align transcription timestamps with diarization segments for accurate speaker-labeled subtitles.

UncategorizedID: benchflow-ai/skillsbench/Automatic Speech Recognition (ASR)

Install this agent skill to your local

pnpm dlx add-skill https://github.com/benchflow-ai/skillsbench/Automatic Speech Recognition (ASR)

Skill Files

Browse the full folder contents for Automatic Speech Recognition (ASR).

Download Skill

Loading file tree…

Select a file to preview its contents.